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4 <liO> APPLICANT: BEIJING INSTITUTE OF RADIATION MEDICINE et al 

5 Shifu, ZHAO 

6 Yong, ZHANG 

7 Lili, Yin 

9 <12 0> TITLE OF INVENTION: Glycine-Rich Proteins, Their Coding Genes and 
Applications 

11 <130> FILE REFERENCE: JEEKP103US 
C--> 13 <140> CURRENT APPLICATION NUMBER: US/10/582,420 
C--> 13 <141> CURRENT FILING DATE: 2006-06-09 

13 <150> PRIOR APPLICATION NUMBER: CN200310117354 . 6 

14 <151> PRIOR FILING DATE: 2003-12-11 
16 <160> NUMBER OF SEQ ID NOS: 14 

18 <210> SEQ ID NO: 1 

19 <211> LENGTH: 79 

20 <212> TYPE: PRT 

21 <213> ORGANISM: Homo sapiens 

23 <400> SEQUENCE: 1 

24 Met Pro Val Ala V 

25 1 5 

26 Asp Arg Val Lys M 

27 20 

28 Ala Gly Ala Leu P 

29 35 

30 Gly Arg Glu Leu M 

31 50 

32 Gly Thr Phe Gly T 

33 65 70 75 

35 <210> SEQ ID NO: 2 

36 <211> LENGTH: 240 

37 <212> TYPE: DNA 

38 <213> ORGANISM: Homo sapiens 

40 <400> SEQUENCE: 2 

41 atgccggtgg ccgtgggtcc ctacggacag tcccagccaa gctgcttcga ccgtgtcaaa 60 

42 atgggcttcg tgatgggttg cgccgtgggc atggcggccg gggcgctctt cggcaccttt 120 

43 tcctgtctca ggatcggaat gcggggtcga gagctgatgg gcggcattgg gaaaaccatg 180 

44 atgcagagtg gcggcacctt tggcacattc atggccattg ggatgggcat ccgatgctaa 240 

47 <210> SEQ ID NO: 3 

48 <211> LENGTH: 80 

49 <212> TYPE: PRT 

50 <213> ORGANISM: Danio rerio 

52 <400> SEQUENCE: 3 

53 Met Pro Val Ser Val Gly Ser Tyr Gly Gin Gin Ala Gin Pro Ser Cys 

54 1 5 10 15 

55 Phe Asp Arg Val Lys Met Gly Phe Met Met Gly Phe Ala Val Gly Met 



Gly Pro Tyr 


Gly Gin Ser 


Gin 


Pro 


Ser Cys Phe 




10 






15 


Gly Phe Val 


Met Gly Cys 


Ala 


Val 


Gly Met Ala 




25 






30 


Gly Thr Phe 


Ser Cys Leu 


Arg 


He 


Gly Met Arg 


40 






45 




Gly Gly lie 


Gly Lys Thr 


Met 


Met 


Gin Ser Gly 


55 




60 






Phe Met Ala 


He Gly Met 


Gly 


He 


Arg Cys 
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56 20 25 30 

57 Ala Ala Gly Ala Met Phe Gly Thr Phe Ser Cys Leu Arg He Gly Met 

58 35 40 45 

59 Arg Gly Arg Glu Leu Met Gly Gly Val Gly Lys Thr Met Met Gin Ser 

60 50 55 60 

61 Gly Gly Thr Phe Gly Thr Phe Met Ala He Gly Met Gly He Arg Cys 

62 65 70 75 80 

64 <210> SEQ ID NO: 4 

65 <211> LENGTH: 128 

66 <212> TYPE: PRT 

67 <213> ORGANISM: Anopheles gambiae 
69 <400> SEQUENCE: 4 



70 


Tyr 


Tyr 


Tyr 


Val He 


Val Val His 


Cys 


Cys 


Asp Asn Thr 


His 


Phe Asn 


71 


1 






5 






10 






15 


72 


Glu 


Phe 


Val 


Pro Lys 


He Lys Leu 


Pro 


Pro 


Arg Lys Arg 


Tyr 


Val Arg 


73 








20 




25 






30 




74 


Ser 


Gly 


Ser 


Phe Gin 


He Leu Gin 


Lys 


Thr 


Asp Thr Lys 


Ser 


Thr Met 


75 






35 




40 






45 






76 


Pro 


Ala 


Val 


Pro Gly 


Gly Val Tyr 


Ser 


Gin 


Asn Gin Gin 


Pro 


Ser Cys 


77 




50 






55 






60 






78 


Phe 


Asp 


Arg 


Met Lys 


Met Gly Phe 


Thr 


He 


Gly Phe Cys 


Val 


Gly Met 


79 


65 








70 






75 




80 


80 


Ala 


Ser 


Gly 


Ala Leu 


Phe Gly Gly 


Phe 


Ser 


Ala Leu Arg 


Tyr 


Gly Leu 


81 








85 






90 






95 


82 


Arg 


Gly 


Arg 


Glu Leu 


He Asn Asn 


Val 


Gly 


Lys Val Met 


Val 


Gin Gly 


83 








100 




105 






110 




84 


Gly 


Gly 


Thr 


Phe Gly 


Thr Phe Met 


Ala 


He 


Gly Thr Gly 


He 


Arg Cys 


85 






115 




120 






125 







87 <210> SEQ ID NO: 5 

88 <211> LENGTH: 79 

89 <212> TYPE: PRT 

90 <213> ORGANISM: Drosophila melanogas 

92 <400> SEQUENCE: 5 

93 Met Pro Leu Pro Thr Ser Ser Phe Ser Gin Gin Gly Pro Thr Cys Phe 

94 1 5 10 15 

95 Asp Lys Met Lys Thr Gly Phe He He Gly Phe Cys Val Gly Met Ala 

96 20 25 30 

97 Ser Gly Ala Val Phe Gly Gly Phe Ser Ala Leu Arg Tyr Gly Leu Arg 

98 35 40 45 

99 Gly Arg Glu Leu He Asn Asn Val Gly Lys Thr Met Val Gin Gly Gly 

100 50 55 60 

101 Gly Thr Phe Gly Thr Phe Met Ala He Gly Thr Gly He Arg Cys 

102 65 70 75 

104 <210> SEQ ID NO: 6 

105 <211> LENGTH: 145 

106 <212> TYPE: PRT 

107 <213> ORGANISM: Caenorhabditis elegans 

109 <400> SEQUENCE: 6 

110 Met Pro Val Pro Ser Gly Tyr Ala Ala His Pro Gin Gly Ser Gin Pro 
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111 


1 








5 










10 


15 


112 


Ser Cys 


Phe 


Thr 


Lys 


He 


Arg 


Met 


Gly 


Leu Met Met 


Gly Ala Met He 


113 








20 










25 




30 


114 


Gly Gly Ala Thr 


Gly He 


Leu 


Leu 


Gly 


Gly Phe Met 


Gly Phe Arg Ala 


115 






35 










40 






45 


116 


Gly Met Arg Gly 


Lys 


Asp 


Leu 


Leu 


Leu 


Gin Thr Gly Lys Thr Val Ala 


117 




50 










55 






60 




118 


Gin Ser Gly Gly 


Ser 


Phe 


Gly 


Val 


Phe 


Met Gly Val Ala Gin Gly Leu 


119 


65 










70 








75 


80 


120 


Arg Tyr 


He 


Phe 


Phe 


Lys 


Asn 


Leu 


Ala 


Gly Thr Gly 


Phe Trp Pro Phe 


121 










85 










90 


95 


122 


Ser 


Leu 


Asn 


Phe 


Ser Arg 


Ser 


He 


Asp 


Asn Cys Pro Arg Gly Lys Val 


123 








100 










105 




110 


124 


Val 


Tyr 


Ser 


Thr 


Arg 


Thr 


Asn 


Ala 


Phe 


Arg Phe Thr 


Thr Glu He Glu 


125 






115 










120 






125 


126 


Lys 


Lys 


Glu 


Pro 


Arg Arg 


Asp 


Thr 


Gin 


Arg Ala Val 


Asn Leu Pro Gin 


127 




130 










135 






140 




128 


He 






















129 


145 






















131 


<210> SEQ ID NO; 


: 7 














132 


<211> LENGTH: 162 














133 


<212> TYPE: 


PRT 
















134 


<213> ORGANISM: 


Caenorhabditis elegans 




136 


<400> SEQUENCE: 


7 














137 


Met 


Gin 


His 


Thr 


His 


Lys 


Glu 


Ala 


Asn 


Arg Arg Val 


Leu Gin Arg Lys 


138 


1 








5 










10 


15 


139 


Lys 


He 


Asn 


Leu 


Leu 


Glu 


Met 


Ser 


Asp 


Lys He Cys 


Arg Asn Leu He 


140 








20 










25 




30 


141 


Tyr 


Phe 


Gin 


Asn 


Phe 


Gin 


He 


Arg 


Met 


Gly Leu Met 


Met Gly Ala Met 


142 






35 










40 






45 


143 


He 


Gly 


Gly 


Ala 


Thr 


Gly 


He 


Leu 


Leu 


Gly Gly Phe Met Gly Phe Arg 


144 




50 










55 






60 




145 


Ala 


Gly 


Met 


Arg Gly 


Lys 


Asp 


Leu 


Leu 


Leu Gin Thr Gly Lys Thr Val 


146 


65 










70 








75 


80 


147 


Ala 


Gin 


Ser 


Gly Gly 


Ser 


Phe 


Gly 


Val 


Phe Met Gly Val Ala Gin Gly 


148 










85 










90 


95 


149 


Leu 


Arg 


Tyr 


He 


Phe 


Phe 


Lys 


Asn 


Leu 


Ala Gly Thr Gly Phe Trp Pro 


150 








100 










105 




110 


151 


Phe 


Ser 


Leu 


Asn 


Phe 


Ser 


Arg 


Ser 


He 


Asp Asn Cys 


Pro Arg Gly Lys 


152 






115 










120 






125 


153 


Val 


Val 


Tyr 


Ser 


Thr 


Arg 


Thr 


Asn 


Ala 


Phe Arg Phe 


Thr Thr Glu He 


154 




130 










135 






140 




155 


Glu 


Lys 


Lys 


Glu 


Pro 


Arg 


Arg 


Asp 


Thr 


Gin Arg Ala 


Val Asn Leu Pro 


156 


145 










150 








155 


160 


157 


Gin 


He 




















159 


<210> SEQ ID NO: 


8 














160 


<211> LENGTH: 120 














161 


<212> TYPE: 


PRT 
















162 


<213> ORGANISM: 


Schi zosaccharomyces 


pombe 
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164 


<400> SEQUENCE: 


8 






















165 


Met 


Gin Ser 


Met 


Gin 


Pro 


Ser 


Thr 


Val 


Asp 


Lys 


Leu Lys 


Met 


Gly 


Ala 


166 


1 






5 










10 








15 




167 


lie 


Met Gly 


Ser 


Ala 


Ala 


Gly Leu 


Gly 


He 


Gly 


Phe Leu 


Phe 


Gly 


Gly 


168 






20 










25 








30 






169 


Val 


Ala Val 


Leu 


Arg 


Tyr 


Gly 


Pro 


Gly 


Pro 


Arg 


Gly Phe 


Leu 


Arg 


Thr 


170 




35 










40 








45 








171 


Leu 


Gly Gin 


Tyr 


Met 


Leu 


Thr 


Ser 


Ala 


Ala 


Thr 


Phe Gly Phe 


Phe 


Met 


172 




50 








55 










60 








173 


Ser 


He Gly 


Ser 


Val 


He 


Arg Asn 


Glu 


Asp 


He 


Pro Leu 


He 


Gin 


Gin 


174 


65 








70 










75 








80 


175 


Ser 


Gly Ser 


His 


Trp Asn 


Gin Arg 


Leu 


Leu 


Asn 


Glu Asn 


Ala 


Asn 


Ser 


176 








85 










90 








95 




177 


Ser 


Arg He 


Phe 


Ala 


Leu 


Ala 


Met 


Gin 


Gin 


Ala 


Lys Ser 


Ser 


Pro 


Arg 


178 






100 










105 








110 






179 


Lys 


Ser Asn 


Glu 


Val 


Ala 


Glu 


Cys 
















180 




115 










120 
















182 


<210> SEQ ID NO: 


: 9 






















183 


<211> LENGTH: 113 






















184 


<212> TYPE: 


PRT 
























185 


<213> ORGANISM: 


Sacchromyces cerevisiae 












187 


<400> SEQUENCE: 


9 






















188 


Met 


Pro Pro 


Leu 


Pro 


Gin 


Asn 


Tyr 


Ala 


Gin 


Gin 


Gin Pro 


Ser 


Asn 


Trp 


189 


1 






5 










10 








15 




190 


Asp 


Lys Phe 


Lys 


Met 


Gly 


Leu 


Met 


Met 


Gly 


Thr 


Thr Val 


Gly 


Val 


Cys 


191 






20 










25 








30 






192 


Thr 


Gly He 


Leu 


Phe 


Gly 


Gly 


Phe 


Ala 


He 


Ala 


Thr Gin Gly 


Pro 


Gly 


193 




35 










40 








45 








194 


Pro 


Asp Gly 


Val 


Val 


Arg 


Thr 


Leu 


Gly 


Lys 


Tyr 


He Ala Gly 


Ser 


Ala 


195 




50 








55 










60 








196 


Gly 


Thr Phe 


Gly Leu 


Phe 


Met 


Ser 


He 


Gly 


Ser 


He He Arg 


Ser 


Asp 


197 


65 








70 










75 








80 


198 


Ser 


Glu Ser 


Ser 


Pro 


Met 


Ser 


His 


Pro 


Asn 


Leu 


Asn Leu 


Gin 


Gin 


Gin 


199 








85 










90 








95 




200 


Ala 


Arg Leu 


Glu 


Met 


Trp 


Lys 


Leu 


Arg 


Ala 


Lys 


Tyr Gly 


He 


Arg 


Lys 


201 






100 










105 








110 






202 


Asp 




























204 


<210> SEQ ID NO: 


10 






















205 


<211> LENGTH: 74 






















206 


<212> TYPE: 


PRT 
























207 


<213> ORGANISM: 


Arabidopsis 


thaliana 












209 


<400> SEQUENCE: 


10 






















210 


Met 


Ala Lys 


Asn 


Ser 


Cys 


Leu 


Ala 


Lys 


He 


Thr Ala Gly Val 


Ala 


Val 


211 


1 






5 










10 








15 




212 


Gly Gly Ala 


Leu Gly Gly Ala 


Val 


Gly Ala 


Val 


Tyr Gly Thr 


Tyr 


Glu 


213 






20 










25 








30 






214 


Ala 


He Arg Val 


Lys 


Val 


Pro 


Gly Leu His 


Lys 


Val Arg 


Phe 


He 


Gly 


215 




35 










40 








45 








216 


Gin 


Thr Thr 


Leu 


Ser 


Ser 


Ala 


Ala 


He 


Phe 


Gly Leu Phe 


Leu 


Gly 


Ala 
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217 




50 








55 










60 










218 


Gly 


Ser Leu 


He 


His 


Cys 


Gly Lys 


Gly Tyr 














219 


65 








70 






















221 


<210> SEQ ID NO: 


: 11 
























222 


<211> LENGTH: 168 
























223 


<212> TYPE: 


PRT 


























224 


<213> ORGANISM: 


Plasmodium falciparum 3D7 












226 


<400> SEQUENCE: 


11 
























227 


Met 


Met Asn 


Trp 


Phe 


Arg 


Lys 


Lys 


Asp 


Ser 


Asn 


Glu 


Asn 


Lys 


Lys 


Val 


228 


1 






5 










10 










15 




229 


Lys 


Ser Glu 


Tyr 


Asp 


Glu 


Tyr 


Val 


Thr 


Pro 


Pro 


Pro 


Phe 


Gly 


Asn 


Tyr 


230 






20 










25 










30 






231 


Leu 


Val Ser 


Glu 


Pro 


Lys 


Lys 


Pro 


Lys 


Ser 


Leu 


Lys 


Asn 


Asp 


Lys 


Thr 


232 




35 










40 










45 








233 


Ala 


He Thr 


Glu 


Phe 


Lys 


Gly 


Phe 


Thr 


Pro 


Pro 


Pro 


Lys 


Phe 


Glu 


Phe 


234 




50 








55 










60 










235 


Lys 


Glu Asp 


He 


Ser 


Asp 


Asn 


Lys 


Tyr 


Glu 


Glu 


Asp 


Phe 


Ser 


Lys 


Tyr 


236 


65 








70 










75 










80 


237 


Thr 


Ser Asn 


Asn 


He 


He 


Asp 


Ser 


Ser 


Phe 


Tyr 


Asp 


Asp 


Lys 


Lys 


Lys 


238 








85 










90 










95 




239 


Leu 


Ser Asp 


Val 


Asn 


Leu 


Ser 


His 


Arg 


Thr 


Arg 


Ala 


Cys 


Phe 


Glu 


Ser 


240 






100 










105 










110 






241 


lie 


Lys Met 


Gly 


Val 


Lys 


Met 


Gly 


Thr 


Met 


Val 


Gly 


Gly 


He 


Phe 


Gly 


242 




115 










120 










125 








243 


Ser 


Leu Thr 


Gly 


He 


Tyr 


Ala 


Ser 


Phe 


Ala 


His 


Lys 


Asn 


Leu 


Phe 


He 


244 




130 








135 










140 










245 


Leu 


Pro Val 


Ser 


Val 


Leu 


Gly 


Gly 


Ala 


Val 


Ser 


Phe 


Gly 


Phe 


Phe 


Leu 


246 


145 








150 










155 










160 


247 


Gly 


Cys Gly 


Met 


He 


Val 


Arg 


Cys 


















248 








165 
























250 


<210> SEQ ID NO: 


: 12 
























251 


<211> LENGTH: 167 
























252 


<212> TYPE: 


PRT 


























253 


<213> ORGANISM: 


Plasmodium yoelii yoelii 












255 


<400> SEQUENCE: 


12 • 
























256 


Met 


Met Asn 


Trp 


Phe 


Lys 


Lys 


Lys 


Glu 


Thr 


Thr 


Glu 


Glu 


Pro 


Gin 


Val 


257 


1 






5 










10 










15 




258 


Lys 


Ser Glu 


Tyr 


Asp 


Ser 


Tyr 


Val 


Thr 


Pro 


Pro 


Pro 


Phe 


Gly 


Asn 


Tyr 


259 






20 










25 










30 






260 


Leu 


Ala Lys 


Lys 


Pro 


Glu 


Lys 


Pro 


Lys 


Ser 


Leu 


Lys 


Asn 


Glu 


Lys 


He 


261 




35 










40 










45 








262 


Asn 


Val Thr 


Glu 


Phe 


Lys 


Gly 


Phe 


Thr 


Pro 


Pro 


Pro 


Lys 


Phe 


Glu 


Phe 


263 




50 








55 










60 










264 


Lys 


Glu Asp 


Thr 


Thr 


Asp 


Thr 


Gin 


Tyr 


Asp 


Gin 


Asp 


Phe 


Ser 


Lys 


Tyr 


265 


65 








70 








75 










80 


266 


Thr 


Asn Asn 


Asn 


Phe 


He 


Asp 


Ser 


Ser 


Phe 


Tyr 


Asp 


Asp 


Lys 


Pro 


Asn 


267 








85 










90 










95 




268 


Met 


Phe Asp 


Phe 


Thr 


Leu 


Ser 


His 


Arg 


Thr 


Lys 


Ala 


Cys 


Leu 


Glu 


Ser 


269 






100 










105 










110 
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